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1.   INTRODUCTION 
The  binomial  distribution  function, 

f 

is  difficult  to  tabulate  directly  because  the  tabulation  involves  three 
variables:   p,  n,  and  s.  Abraham  DeMoivre  (1730)  was  the  first  to 
consider  the  problem  of  approximating  the  binomial  probabilities  using  the 
normal  distribution.  The  Laplace-DeMoivre  result  involves  term  by  term 
normal  approximation  to  the  binomial  probabilities.  A  proof  for  the 
DeMoivre-Laplace  Limit  is  given  by  Feller  (1950). 

Today  there  are  many  different  approximations  for  the  binomial,  but  the 
accompanying  problem  appears  to  be  that  of  determining  the  error  involved 
when  each  approximation  is  used.   Five  different  approximations  are  pre- 
sented in  this  paper.  Peizer  and  Pratt  (1966)  developed  a  new  normal 
approximation,  z,      or  the  more  refined  z_,  which  appears  to  give  a 
smaller  error  than  Camp's  (1951)  cube  root,  Pinkham's  (1957)  square  root, 
or  the  usual  normal  approximation.   The  Poisson  Gram-Charlier  approximation 
performs  best  if  it  is  used  only  if  n  ^5  and   np  ^  0.8   (Raff,  1956). 
Finally  Govindarajulu' s  (1^63)  approximation  can  be  made  as  accurate  as  one 
desires  because  the  error  term  can  be  made  as  small  as  desired. 

Let  P  be  a  binomial  tail  probability.  Then  define  0  to  be  the 
standard  normal  left  or  right  tail  function  according  as  P  is  a  left  or 
right  tail  probability.  A  value  z  is  sought  such  that 

2 


jj(z)   =  J      (2-0- )~^  e  -^'^   dx 


is  approximately  equal  to  P  if  P  is  a  left  tail  probability 
or 

2 
z 

is  approximately  equal  to  P  if  P  is  a  right  tail  probability. 

Let  the  parameters  of  the  binomial  be  n  and  p.   Let  s   be  the  number  of 

successes.   Then,  if  P  is  a  right  tail  probability,  let 

S  =   s  -  ,5 
and 

T  =   n  -  S 

The  following  example  will  be  given  at  the  end  of  each  section  with 
the  exception  of  the  section  on  Govindarajulu' s  approximation,  where  the 
calculations  are  difficult  and  the  error  can  be  made  as  small  as  one  wishes 
by  continuing  the  expansion  to  as  many  terms  as  needed. 

What  is  the  approximate  probability  for  4  or  more  successes  under  the 
binomial  distribution  with  parameters   n  =  5  and  p  =  .25? 

The  probability  can  be  computed  exactly  by  evaluating  the  following     , 
terms  of  the  binomial  expansion: 

(^)  (.25)^  (.75)^  +  (^)  (.25)^  (.75)°  =   .015625 

Before  presenting  the  five  approximations  considered  in  this  paper,  the 

example  is  computed  using   z  =  ^  ^nd   z  =  ^  .   The  reason  for 

(npq)^  (npq) 

looking  at  these  two  ordinary  approximations  is  to  show  how  they  compare  with 
the  approximations  presented  in  this  paper. 


'    s  -  np      ,  ^   J     •  _  ^  -  ^'25  _  ...2.75  _  _  „ /-- 
The  case  for  z  =  T  results  in  z  =         -      q^q,  -  2.BWJ 

(npq)  Y.yjyj 

because  np  =  5(.25)  =  1.25  and  npq  =  5(.25)  (.75)  =  .9375. 
The  desired  approximation  to  the  right  tail  probability  is 

0^(z' )  =   .0023  . 
2  o 

The  second  approximation  z  =  — ^— ^  involves  replacing  s  with  S 

(npq)^ 

where  S=s-.5  or  S=A-.5=3.5.  Thus 

S  -  np  _  3.50  -  1.25  ^   2.25  _   „  -239 
(npq)  V  .9375 

The  desired  approximation  to  the  right  tail  probability  is 

0„(z  )  =   .0101  , 
20 


2.   PEIZER  AND  PRATT  APPROXIMATION 

In  order  to  find  the  probability  P,  the  function  0j^(z),  or  ^^^z)t 
may  be  used;  but  then  the  proper  value  of  z  still  remains  undetermined. 
The  unknown  value  of  z,  which  depends  on  the  values  of  n,  p,  and  s, 
may  be  estimated  several  different  ways.  The  following  method  presents  z. 
and  z„,  two  estimates  of  the  unknown  z.  The  quantities  z  ,  z^,  and 
z,  are  introduced  to  show  the  different  stages  in  the  development  of  z. 
and  z  . 

Peizer  and  Pratt  (1966)  considered 

(2.1)     z,   =  r4r7 ^ r  <S  In  ^+  T  In  ^)(   ;     i  =  1,  2, 

or  for  computational  purposes, 


(2.2) 


as  their  approximation  for  the  unknown  quantity  z,   where  d   has  the 
value  .       .    ' 

d^   =   S  +  ^  -  (n  +  J)  p  , 
or 

and  g  can  be  obtained  either  from  a  table  or  by  direct  calculation  as 
explained  later. 

They  started  their  search  for  a  normal  approximation  to  the  binomial 

2 
probabilities  by  looking  at   z  =  (S  -  np)/cr-   where  CT      =  npq,   and 

stated  that  one  reason  why  z   is  inaccurate  is  that  it  misplaces  the 
median.   The  median  and  the  mean  in  a  normal  distribution  are  the  same,  but 
this  is  not  the  case  in  the  binomial  distribution  unless  p  =  ^.   Therefore 
using  z  ,   one  estimates  the  mean  accurately,  but  not  the  median.   After 
studying  the  median  numerically  and  asympototically,  they  found  the  follow- 
ing modification: 

(2.3)     z^  =  d^/cr   =   [S  -  np  +  (q  -  p)/6]/o- 

The  modification,   z^,   gives  better  accuracy  than   z   near   z  =  0.   How- 
ever,  z~   is  not  necessarily  as  accurate  as   z   when  the  value  of  z  is 
■i  o 

not  near  zero. 

The  next  step  is  to  try  to  find  an  approximation  for  z  that  gives 
greater  accuracy  than  z   in  the  tails  of  the  distribution  as  well  as  around 


z  =  0.  To  accomplish  this,  they  chose  to  observe  the  graph  of  z/z-.  This 
relationship  appears  to  be  approximately 


(2.4)     —  =  f(p,D) 
^3 


2 


z 


when  D  =  (S  -  np)/ cr  =  -^p  and  f  is  a  gently  varying  function.   Other 
arguments  could  be  used,  for  example,   p  and   S/np.   However,   D  shows  the 
symmetry  aspect  of  the  situation  better. 

In  order  to  tabulate   f,   and  thus  use  z-f  as  an  approximation  to  z, 
it  is  desirable  to  make  z/z,  more  nearly  a  function  of  p  and   D  alone 
rather  than  to  make  z^  closer  to  z.   An  asymptotic  calculation  of  the 
central  temn  or  terms  of  the  binomial  distributions  leads  to  replacing  n 
by  "  ■*■  r  in  the  denominator  of  z^.   The  same  result  could  be  found  by 


examining  the  graph  of  z/z^  for  p  =  k     (Pratt,  1966). 

1  +  q  g  (rr)  +  p  g  (rr) 


Since  z,  =  d,  S  '^. ^ 

'  (n  +  T)  pq 


it  can  be  seen  that  z.   can  be  expressed  as 


J- 


(2.5)     z^f(p,D)   where  z^  =  d^^  [(n  +  ^)  pq]    and 


f   =   [l  ^  q  g  (f;)  ^  P  g  (^)] 


By  substituting  the  value  of  d.   given  in  (2.3),  z,      may  be  written  as 
(S  +  c^  -  np)/(n  +  c^)  pq 

for  c.  =  (q  -  p)/6  and  c^   =  -.   Other  choices  for  c   and  c   lead  to 
no  better  approximation  (Pratt,  1966). 

To  make  the  approximation  accurate  in  the  extreme  tails,  define  f  to 
be  the  limit  of  z/z-   as  tT—^o^    for  p  and  D  fixed.  Thus  one  lets 


n  — »  o^   and  looks  at  the  limit  as  the  number  of  success  either  gets  larger 
or  smaller  for  fixed  p.   These  results  can  be  obtained  from  results  of 
Bahadur  (1960).   This  makes  the  relative  error  in  the  approximation  to   z 
by   z.  =  z,  f  approach  0  as  0~— »  e^  for  fixed   D  ^  0.   Using  this,   f 
can  be  expressed  in  terms  of  familiar  functions,  in  fact,  in  terms  of  a 
function  of  one  variable; 

(2.6)     f  =   G^  =   [d  +  q  g  (|^)  +  P  g  (—)] 

where  the  function  g  is  defined  as 

2 

/  \     1  -  x  •«■  2x  Inx         rt      J  , 
g(x)   = ,    x>  0,    X  t   I 

(1  -  x)"^ 

g(0)  =  1,    g(l)  =0, 

g(x)  =  -g(x   ) 

Peizer  and  Pratt  (1966)  have  tabulated  the  function  g(x). 

The  function  G  can  be  expressed  with  arguments  p  and  D,   which  is 

the  form  specified,  by  using  the  relationship  D  =  — "^  . 

S 
The  relationship  is  solved  for  —  in  the  following  manner: 

_   S  -  np  _    S    np   _    S    1. 
"Pq       npq   npq     npq   q 

or 

qD  =   —  -  1  ;  —  =   1  +  qD  . 
np      '  np         ^ 

T 
By  using  the  same  technique,  one  obtains  — : 

nq 

D  =   S  -  np  ^  _S_  _  1   ^   n  -  T  _i  ^   1   _  T  _  1 
npq       npq   q      npq    q     pq   npq  ~  q 

or 


T 
npq 

pq     q 

T 
nq 

q     q     '^ 

^^  "  P^  -  pD  =   1  -  pD  . 

q 

s  T 

When,   1  +  qD  is  substituted  for  —  and  1  -  pD,   for  ~  , 

G  =   1  +  p  g(l  -  pD)  +  q  g(l  +  qD)  . 

In  order  to  give  greater  accuracy  near  the  median,  Peizer  and  Pratt 
introduced  z  ,  a  modification  of  their  z. . 

The  relationship  between  z,   and  z.  can  be  seen  below. 

h 


Let  z^  =  (dj^  +  c^)  •) ^ (    where  c^  =  0(0"  "")  . 


For  i  =  1,   let  c  =  0  . 


For  i  =  2.   let  c^  =  .02  ^jt~:j  -  YTZ 


a _2 +  q  -  '5 


5   n  +  1 


) 


The  example  presented  in  the  introduction  is  calculated  for  z.   and 


^^ 


z   as  follows: 

fl  +  q  g(— )  +  p  q(— ) 
,  T,    .        ,    .            ,    J  J    ^  °  np    '^   np 
Peizer  and  Pratt's  approximation  z,   equals  d.  1 ^j 

'  (.  (n  +  ^)   pq 

where     T  =   n  -  S     or     5  -  3.3  >  1.5,   ^  -  ^^  =  2.8,   ^  =  j^^  =    .4   . 

—  /  —  =  ■^  =   7,   d,    =   S  +  2-  -   (n  +  ^)p  =   3.5  +  ^  -   (5  +  ^)    (.25)   =   2.3333    , 
npnq.Al  6  3*^  6  3 

and  g  is  obtained  from  Peizer  and  Pratt's  (1966)  tabulation  of  the  func- 
tion g  where  g(2.8)  =  -g(r^)  =-.3315  and  g(.4)  =  .2971. 
These  quantities  substituted  in  (2.2)  give 


=     2.3333  r   -  ^'''^    ^3315M    .25^297U 


h 


(5  +  ^)    (.25)    (.75) 


=     2.3333  4     Jz.lA       =     2.3333(.92319)     =     2.1541    . 


-"3{tfgff] 


The  desired  approximation  to  the  right  tail  probability  is 
02^^!^  "      .01562  . 

Peizer  and  Pratt's  approximation  z-  equals  d  J "^r 

L     Cn  +  ^)pq 

is  a  refinement  of  their  z,   where  d-  =  d,  +  .02  V  r-f — r  -  z~ — r  +   ~  ,  f 

1  2     1       (_S  +.5    T+.5    n+lj 

=  2.3333  +  .02  (-  'l^   ,  -  ■  ^'f^  ,  +  r*r^)  =  2.3333  +  .0021  =  2.3354  . 
3.5  +  .5   1.5  +  .5   5+1 

Thus  z^   =  2.3354  (.92319)  =  2.1560  . 

The  desired  approximation  to  the  right  tail  probability  is 

02^22^  "   .01554  . 

In  this  example  z.   was  better  than  z.,   but  usually  z-  is  better. 
Peizer  and  Pratt  (1966)  have  graphed  the  relative  error 

P  -  0(z„) 
(2.7)    €    =  t ^ 


min  [jHz^),    1  -  ^(z^)  J 

of  0  (z^)   against  Sq/Tp  for  many  values  of  S,   n,   and  p. 
By  solving  (2.7)  for  P, 

P  =   0(Z2)  t    £  mint0(z2),  1  -  0(z2)]   , 

the  approximation  0  (z„)  can  be  corrected  by  using  the  value  of  C  found 
in  the  appropriate  graph. 


For  z_   in  the  example,  the  relative  error,  G.  ,   is  -.005.   Then  the 
corrected  probability  is 

P  =   .01554  -  (-.005)  (.01554)  =   .01526  . 


3.   CUBE  ROOT  APPROXIMATION 


Camp  (1951)  gives  the  following  approximation  to  the  sura  of  the  first 
(t  +  1)   terras  of  the  binomial  probability  function  where  t  equals  the 

number  of  failures:   Let  the  binomial  probability  function  be  generated  by 

-1  "   -t  t 

the  terms  of   (p+q)  .   Let  the  sum  =  S   .  =p  +np   q+ +(t)p   q. 

Then  S     is  approximately  equal  to  the  probability  that  a  unit  normal 


deviate  will  exceed   z 


1/3 


where 


(3.1) 


i  r(^^^)  (-f-r  h 
3  L    s     t  +  1  p 


1/3 


9t  +  8 


t  +   U 


'1/3 


r,  2/3    ,  -]   1/: 

Ls   t  +  1  p^        t  +  ij 


s  =  n  -  t 


In  an  attempt  to  standardize  the  notation.  Camp's  approximation  is 
re-written  in  terms  of  the  notation  used  by  Peizer  and  Pratt. 

Let  s  =  b  =  S  +  ^  and  t+l=a=T+%. 


Then 


(3.2) 


1/3 


[ig  -  i)  (^)    -  9  + 


'1/3 


_a2_ 


3 


r.      .      2/3   .-]   1/: 
[_b  ap      a  / 


Camp  (1951)  gives  the  following  example  which  shows  errors  of  approxima- 
tion for  the  special  case;   n  =  8,   p  =  0.80: 


10 


( 

.8 

+  .2r 

t 

\'   ' 

Error 

Approx. 

True 

0 

.166 

.168 

-   .002 

1 

.505 

.503 

.002 

2 

.801 

.797 

.004 

3 

.943 

.944 

-   .001 

5 

.999 

.999 

.000 

While  an  error  of  .005  appears  to  be  small,  when  P  is  actually  .01,  the 
relative  error  would  be  50  per  cent. 

The  cube  root  approximation  (Pratt,  1966)  belongs  to  the  family  of 
approximations  coming  from  root  transformations  of  chi  square  random  vari- 
ables.  It  will  be  shown  that  one  can  go  from  the  chi  square  to  the  F 

distribution,  to  the  incomplete  beta,  and  hence  to  the  binomial  distribu- 

2 
tion.   Let  X   have  a  chi  square  distribution  with  v  degrees  of  freedom. 

Then  treat   (X  /v)""  (m  >0)   as  normally  distributed  by  the  Central  Limit 


Theorem.   That  is 

^2,..^m   „r,„2,  .m-, 


(3.3) 


(XVy)*"  -  e[(XVv)"'1 


V 


N(0,  1)  . 


Var  (X^/v)" 


One  can  find  the  mean  and  variance  plus  the  third  central  moment  and 
the  median  since  Wilson  and  Hilferty  (1931)  have  computed  these  quantities 
for   (X^/v)™. 


fn    /  \  ujj        1    2ra^  2m  ,     29.  . 

(3.4)     Median  =   1  -  —  +  — -  (m  -  — )  + 

3v   „  2      45 
9v 


(3.5)     E(X^/v)'°  =   1  +  -  (m  -  1)  +  ^  (m 

^  3v 


-  1)  (m  -  2)  + 


u 


2 

(3.6)  Var   (X^/v)"  =  ~  +  ^  (m  -  1)  (3m  -  1 )  +  . . . 

V 

(3.7)  Third  central  moment  of   (X'^/v)™  =  —  (3m  -  1)  +  ... 

V 

Then  one  can  estimate  the  probability  of  the  chi  square  random  variable 
in  the  following  way: 

,3.3,    ^  ^    p^X^/v)--E[(xV^^^^^ 


/"var  (X^/v)™ 


c(  ^   P[(x2/v)'"5:  y;  =  E[(x2/v)"']  +   /vITu^/v)"   z^  ] 

Since  the  mean  and  variance  can  be  found,  one  can  solve  for  z^  and  then 
use  the  normal  probability  tables  to  obtain  an  approximate  value  for  o(    , 

Because  the  F  distribution  is  a  ratio  of  two  independent  chi  squares 
divided  by  their  degrees  of  freedom,  one  can  extend  the  previous  discussion 
to  obtain  a  normal  approximation  for  the  F  distribution. 


2        2 
Let  X^  and  X^  be  two  independent  chi  square  random  variables. 


Then 


w  ^  0 


The  first  quality  follows  from  the  fact  that  chi  square  has  only  positive 
values. 

The  quantity   (X./v. )   -  w  (X./v.)   approaches  normality  because  a 


12 


linear  combination  of  independent  normal  variables  is  normal  with  mean  of 
the  difference  between  the  means  of   (X  /v  )    and  ^^■/'^2^        '""'^  variance 
of  the  sum  of  the  variances  of  the  previous  quantities.   Thus  the  normal 
approximation  can  be  applied,  leading  to  a  family  of  normal  approximations 
for  the  F  distribution. 

Wilks  (1962)  gives  a  transformation  for  going  to  an  incomplete  beta 
from  the   F  distribution.   Therefore  one  can  obtain  a  normal  approximation 
for  the  beta  distribution.   Feller  (1950)  shows  that  the  binomial  can  be 
obtained  from  the  incomplete  beta.   In  like  manner,  one  can  obtain  a  normal 
approximation  for  the  binomial. 

Since  one  wants   (X  /v)   to  approach  normality  as  fast  as  possible, 
one  chooses  a  value  of  m   that  will  make  the  distribution  as  symmetric  as 
possible.   Wilson  and  Hilferty  (1931)  picked  ra  =  7.   For  this  value  the 

leading  term  of  the  third  central  moment,  or  the  measure  of  skewness,  drops 

2   1/3 
out.   Then   (X  /v)     is  treated  as  a  normal  variate  with  mean   (1  -  2/9v) 

and  variance  of  2/9v,   using  only  the  first  two  terms  of  the  expansion  of 

the  mean  and  variance.   The  median  and  the  mean  agree  through  the  terms  of 

order  —  when  m  =  — ,   This  is  only  true  when  m  has  this  value. 

V  3  ^ 

Paulson  (1942)  used  the  method  previously  described  in  obtaining  a 
cube  root  approximation  for  the   F  distribution  from  the  Wil son-Hi  If erty 
cube  root  approximation.   Camp  (1931)  extended  Paulson's  work  to  obtain  an 
approximation  to  the  incomplete  beta.   Thus  he  was  able  to  arrive  at  a 
normal  approximation  to  the  binomial  distribution. 

The  example  presented  in  the  introduction  is  calculated  using  Camp's 
cube  root  approximation  z'  _. 
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z 


I-   -  b    ^'f/'   -  ^  -  il     [--b^^'"-^^^ 


1/2 


35    ^/^   12 

4   ^^^    ~  2     ^   (8.75)  (1.82)  -  8.5  ^   2.1484 


fl    2/3    1  1/2      r  ^ 

3[t  (6)    .  ^J        3[^  (3.31)  .  - 


1/2 


because  b  =  S  +  .5  =  3.5  +  .5=4  and  a  =  T  +  .5  =  1.5  +  .5  =  2  . 
The  desired  approximation  to  the  right  tail  probability  is 

02<z[/3)  =   -0159  . 

4.   SQUARE  ROOT  APPROXIMATION 

The  unknown  value  z  can  be  estimated  by  using  the  square  root 
approximation  z,   suggested  by  Pinkhara  (1957),  where 

(4.1)     z'   =  [(4b  -  l)q]^  -  [(4a  -  l)p]^ 

and  as  in  the  previous  section, 

a  =  T  +  ^ 
and 

b  =  S  +  %  .     . 

The  same  argument  is  used  here  as  was  used  in  the  cube  root  approxima- 
tion. 

2  k 
Choose  m  =  %.   Then  treat   (X  )   as  a  normally  distributed  variate 

%  1 

with  mean  v   and  variance  r.  This  leads  to  Freeman  and  Tukey's  (1949, 

1950)  approximation;  ' 
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(4.2)      z^     =   2L(bq)^  -  (ap)^] 

Pinkham  (1957)  studied  the  first  four  moments  of  a  transformed  beta 
variable  and  decided  to  modify  zV  with  Fisher's  modification  of  the  mean 
leading  to   z,  . 

Although  z,   is  not  as  accurate  as   z, /o*  it  is  useful  for  variance 
stabilization  according  to  Pratt  (1966).   The  variance   (X  )    is  asymptot- 
ically constant  only  for  m  =  ^. 

The  example  using  Pinkham' s  square  root  approximation  Zi  = 

[(4b  -  l)q]^  -  [(4a  -  l)p]^  results  in  [(4(4)  -  1).75]^  -  [(4(2)  -  1).25]^ 

=  [(15)  (.75)]^  -  [7(.25)]^  =   [11.25]^  -  [1.75]^  I 

=  3.3541  -  1.3229  =   2.0312  | 

The  desired  approximation  to  the  right  tail  probability  is 

5.   THE  ABSOLUTE  AND  RELATIVE  ERROR  OF  ^q*  z^*  ^2'  ^1/3'  ^1/2 


Peizer  and  Pratt  (1966)  have  graphed  the  ratio  of 

true  left  tail     approx  right  tail  |a  ^^^  ^^^   approximations 

approx.  left  tail    true  right  tail     ^       Tp  ^'^ 

previously  mentioned  with  the  exception  of  z  .   The  ratio 

true  left  tail     approx.  right  tail      ,  ,    . .     ^   r  \._ 

; — — •  X  — ^ r~rr ~, —  "lay  be  expressed  as  the  rate  of  true 

approx.  left  tail    true  right  tail 

and  approximate  P/l-P  where  P  +  1-P  equals  the  total  area.   These 

graphs  appear  to  show  that   z„  is  the  best  approximation  followed  by  z.  , 

the  cube  root,  the  square  root,  and  last,  the  ordinary  normal  approximation. 

At  a  fixed  value  of  Sq/Tp,  other  than  one,  with  S  increasing,   z 
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and  z   are  the  only  approximations  that  appear  to  remain  relatively 
accurate. 

Pratt  (1966)  confirmed  the  conclusion  drawn  from  the  graphs  in  the 

previous  paragraph  by  asymptotic  theory. 

2 
He  considered  the  case  where  0"  — d  c^  ,   z  is  bounded,  and  p,   S, 

and  T  are  varying.   Then  z  is  not  equal  to  -  .«»o  or  +  o^o   ,  i.e.,  P 

is  bounded  away  from  0  and  1.   Then  the  order  of  magnitude  of  error  in  P 

for  the  different  approximations  is  as  follows: 

h   °''   ^2 ?T 

.      _1_ 

^1/3  "    ~  0-2 

z   or  z'  ,mi^l/3 "Tpr 

o      m  t' 

3 
For  z,   this  means  that  P  -  0(z)  =  Od/CT  )  and  only  in  special 

cases  is  this  error  of  smaller  order.   For  a  review  of  the  notation  o(*) 

and  0(«)  see  Lukacs  (I960).  The  exception  in  the  binomial  case  is  when 

p  =  ^  and  then  the  error  is  — r  ,  — r  ,  — -     respectively. 

2 
Now  consider  CT  — » o*  and   /z/ -*  o^  as  p,   S,  and  T  vary,   ^fow 

lzj-»<^       but  z  =  o(  a~),   i.e.   P  approaches  0  or  1  and 

D  =  z  /O"   =  — "  "P-^  0  .   Then  the  order  of  magnitude  of  error  in  P  is 
o       npq 

z^  or  z^ -^  D 

^1/3       ^°^ 

z   or  z  .mj'l/S zD  . 

o       m  • 


16 


In  each  approximation  the  error  in  z  is  of  smaller  order  than  z 
itself,  but  only  for  z.      or  z       does  the  error  always  approach  0  as 
a"  — ?  -cyo    . 

The  relative  error  is  defined  as  either  (true  -  approx.)  /  true,  or 
(true  -  approx.)  /  approx.  since  the  error  in  z  is  of  smaller  order  than 

z  itself  and  therefore  of  the  same  order  whichever  Is  used. 

Order  of  magnitude  of  error  relative  in  z  is  as  follows: 

z^  or  z^ —  D 

^1/3      ^' 

z   or  z  ,m^l/3 D 

o      m 

Next  look  at  the  tail,  probability  P  =  0(z)  where  P  is  close  to  0 
or  1  since   /z/ — *  oo  .   The  In  (min  P,  1-P)   is  considered  because 
as  P-^  0,   the  error  in  P  is  hard  to  interpret  except  as  relative  to  the 
true  or  approximate  P.   However,  these  errors  are  of  different  orders  of 
magnitude.  The  error  when  considered  as  error  in  In  P  avoids  this 
problem.   For   /z/ —a  •o"   ,  one  uses  error  in  In  P  when  P  is  near  0 
and  error  in  In  (1  -  P)  when  P  is  near  1.   The  order  of  magnitude  of 
errors  in  In  (min  [P,  1  -  P]}  for  the  different  approximations  is  as 
follows: 

3 
^1  °^     ^2 ° 

^1/3 ^  ° 

•  2  '      • 

z   or  z  ,m?«l/3 z  D 

o       m 
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The  relative  error  in  min  (P,  1-P)   will  approach  0  if 

z  =  o(C7-)   when  z,   or  z   is  used. 

1/2 
z  =  o(  O"    )   when  zj/o   ^s  used. 

z  =  o(0~   )   when  z   or  z' ,mTi'l/3   is  used. 

o       ni 

Both  z,   and  z-  are  nearer  to  standard  normal  than  z   in  the 
extreme  tails.   They  are  a  strictly  decreasing  function  of  p  =  1-q  and 
approach  +  o^   at  p  =  0  and  -  c»o  at  p  =  1   for  fixed   S  and  T. 

The  root  transformations  are  strictly  decreasing,  but  do  not  approach 
*  c^     at  p  =  0  and  -  «»«  at  p  =  1.   Instead,  they  have  the  range 

"  o't  ^  ^™  "^   '^S      '.      . 
provided  >u  ,  ju^,  CTg,  CT^  are  positive  (Pr'att,  1966). 

6.   POISSON  GRAM-CHARLIER  APPROXIMATION 

The  Poisson  Grara-Charlier  approximation  is  based  on  the  fact  that  the 
binomial  distribution  can  be  approximated  by  the  Poisson  distribution  when 
p  is  small,  and  is  defined  by  Raff  (1936)  as  follows: 

B(8;n,p)  =  P(s,  np)  +  %p(s  -  np)AP(s,  np) 

where  s  has  integral  values  from  0   to  n  inclusive 

p(s,  np)    =    e-"P   Z  ^  . 

i=0 
and 
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AP(s,  np)  =  e   *'  ^  . 

This  approximation  is  the  first  two  terras  of  what  Rietz  (1927)  calls  a 
Grara-Charlier  Type  B  series.   The  remaining  terms  may  be  left  out  since  the 
series  converges  very  rapidly.   This  series  is  an  extension  of  the  Poisson 
exponential  approximation  and  the  development  can  be  found  in  Rietz  (1927). 

Raff  (1956)  discusses  maximum  error,  the  largest  possible  error  which 
can  arise  in  estimating  any  sum  of  consective  binomial  terms  with  the  speci- 
fied parameters.   In  this  approximation  the  maximum  error  decreases  as  p 
becomes  smaller.   The  size  of  n  does  not  affect  the  decrease  for  the 
approximation  has  the  property  that  its  maximum  errors  are  practically 
independent  of  n.   The  maximum  error  can  be  kept  below  .005  if  the  approxi- 
mation is  used  only  if  n  ^5  and  np  ^0.8.   He  also  states  that  the  same 
maximum  error  can  be  had  with  Camp's  approximation  if  it  is  used  only  if 
n   5  and  np   0.8.   While  an  error  of  .005  appears  to  be  small,  when  P 
is  actually  .01,  the  relative  error  would  be  50  per  cent. 

The  example  presented  in  the  introduction  does  not  fit  the  criterion 
for  using  the  Poisson  Gram-Charlier  approximation,  but  the  problem  will  be 
presented  for  comparison  purposes.   The  procedure  will  be  to  find  the 
probability  of  having  0,  1,  2,  or  3  successes  and  then  to  subtract  this 
quantity  from  1. 

The  Poisson  Gram-Charlier  approximation 

B(s;n,p)   =  P(s,np)  +  ^p(s-np)AP(s,np) 
where  s  goes  from  0  to  3,   p  =  t,   np  =  r  . 
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5,i 


si       5  3   (J) 


The  quantity  P(s,  np)  =  e"""^  ^     jf"  =  C  ^  ^   n 

i=0  ^'  i=0 

=   .286503  {l  +  f  +  ff  +  5^  =   -286505  [3.3568]  =  .96174  . 


-np  Si 


s 


The  next  step  is  to  calculate  '^p(s-np)AP(s,np)  =  ^pCs-np)  £    ^j 

5  (^)^ 
=  i  (i)  (3  -f)e-4^  =  ^  (J)  (.286503)  (^g)  =   .02040  . 

Therefore  BCsin.p)  =   .96174  +  .02040  =   .98214  . 

The  desired  probability  of  either  4  or  5  successes  out  of  5  trials  is 

1  -  .98214  =  .01786  . 

7.   GOVINDARAJULU  APPROXIMATION 

Govindarajulu  uses  the  fact  that  the  binomial  random  variable  S^  can 
be  expressed  as  a  sum  of  n  independently  identically  distributed 
Bernoulli  random  variables  which  take  the  values  1  and  0  with  proba- 
bilities p  and  q  respectively  to  develop  his  approximation.   The 
binomial  random  variable  can  also  be  expressed  as  a  lattice  random  variable 
of  the  form  c  +  sh  where  c  =  0,  h  =  1,  and  s  =  0,  1,  ...   By  putting 
these  two  results  together,  Esseen's  theorem  (1945)  can  be  used  to  find  the 
probability  that  the  number  of  successes  equals  some  s  where  8=0  to 
n. 

Let   f  (s)  =  P(S  =  s)  =   (")  p*  q""® 

n  n  s  -  > 

_      (s  -  np) 
(npq)^ 

aj^^x)  =  ^—  0(x)     8  =  1,  2,  ... 
1         ,  s 
dx 
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where     0     denotes   the  standard  normal  density   function 
Then 


ys)      =      (npq)-^    [0(X)    -  -^^  0^^\x)    +   ^  [(l-6pq)0' 

/iarEi!  0(6)^^^    _        ^q-p)  yU-12pq)    ^(5)^^j   ^   H^  ^C?)^^) 

^  J       24(npq)^^^i-  ^ 

^   ^q-P)  0^^\x)h  +  o(n"^)   as  given  by  Govindarajulu  (1963). 

To  show  this,  consider  the  following  theorem  (Esseen,  1945)  with  8=5 

and  X.   as  a  Bernoulli  random  variable  with  q  =  0  and   p  =  1: 
J 
Let  X  ,  X  X   be  independent  and  identically  distributed 

lattice  random  variables  taking  values  of  the  form  c  +  sh,  s  =  0,  +  1,  ...« 

where  h  is  the  maximum  span  of  the  distribution.   Let   ^^  ~  ^i  ^  ^2  *  *** 

+  X  .  Also,  let  ja  and  a— respectively  denote  the  mean  and  standard  devia- 
n 

tion  of  X  .   If  X   has  finite  absolute  moment  of  the  order  sCs  ?- 3) 
i        i 

inclusive,  then  for  all  k 


s-2  -^  1 

-\f0(x)  +  Z  n  ^  P.(-«(x))  \ 
•nU       j=l      J       -* 


P(S  =  nc  +  hs)  =  — \^0(x)  +  Z  n  '  P.(-«(x))  \  +  o(n'  ®"     ) 


where 


X  =   (nc  +  hs  -  nu)/o"n   and  P.   is  a  polynomial  as  defined 

below. 

If  F  (x)   is  the  cumulative  distribution  function  of  a  standardized 
n 

sura  of  n  independent  and  identically  distributed  random  variables  having 
variance  <r  and  cumulants  j  ^  o"*^  I  v  ^  3  ^  and  9  is  the  standard 
normal  c.d.f.  then  . 
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*°   -1/2 
F  (x)  =  0(x)  +  i.  n  •''   P,(-«(x)) 

j=l        ^ 

where  P,(-w)   is  a  polynomial  of  degree  3j   with  coefficients  depending  on 
the  cumulants  of  the  component  random  variables  of  orders  3   through  j  +  2. 
They  are  defined  by 

*^        -i/2 
exp  t-  I.      *.-w;'   n  -   \  =   1  +  Z  P.(-w)n 


[,!,  TFirf  '-">^"  "■^"]  = 


1   J 


where  P.(-0)   is  calculated  by  replacing  w   by  0(r)   in  P  (-w), 

B^'^^x)  =  ^ —  0(x)   (Edgeworth,  1956). 
dx'^ 

Let  c  =  0 

h  =  1 

s  =  0,  1,  2,  ... 

p  =  mean  of  X. 

pq  =  variance  of  X. 

Using  the  previous  conditions, 

^  ^   (n(0)  ■»•  s  -  np)   ^   ^s  -  np) 
(pq)^  (n)^         (npq)^ 

5-2    .  -)  -^^-^> 

P(S  =  8)   =  — ^  U<x)  +  ^  n"^^^  P.(-0(x))  (  +  o(n    ^   ) 


f. 


-^  U(x)  +  Z  n"J^^  P.(-0(x))[  +  o(n~^) 


npq 


Z  n"J^^  P.(-0(x)) t 
j=l        J       J 


iTi;" 


—  \0(x)  +  n~^  Pj^(-0(x)  +  n"^  P^C-flCx)) 


pq 


+  vT^'^   P^(-«(x)]  1+  o(n"^) 


[x)]j+  o(n"' 
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To  evaluate  the  polynomials,  consider  the  curaulants  of  the  point 
binomial  where  tr"^  =  pq,   and  ju  =  p.   Kendall  and  Stuart  (1958)  define 
the  cumulants  in  terms  of  moments. 

k,  =  p 

2  ... 

1^2  =  P  -  P 

k3  =  p  -  3p^  +  2p^ 

2      3     4 
\     =  P  -  7p  +  12p''  -  6p 

k   =  p  +  15p^  +  50p^  -  60p^  +  24p^ 
In  the  problem  the  cumulants  are  defined  as  k^  =   A^  ^   ^°  ^^*^ 


X3     = 

"3 
<r3 

= 

2             3 
D  -  3d     +   20"* 

(pq)f 

= 

D(l  -  3d  +   2d^) 
(pq)| 

pfd   -  2p  +   p^)   - 

(p  ■ 

■pM 

(pq)| 

= 

p[q    -  pq] 
(pq)f 

= 

pq(q  -  p) 
(pq)| 

SE 

q  -  p 
(pq)^ 
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P^(-0(x))  =  (6)'^  A3  (-0(x))^ 

=  (6)"^  a-^  (H2Kx))^ 

(pqr 

Similarily  P   and  P„  may  be  calculated 

2^(.-<iU))     =  (24)"^  A  ^  0^(x)  +  (72)'^  A  3  (3^(x) 

P,(H8(x))  =  (120)"^(-1)  A.  0^(x)  +  (144)'^(-1)  X   A  4  0^(x) 


+  6~^(-l)  A3  0^(x) 


Then 


f  (8)  =   (npq)"^  [0(x)  -  ^^-^-4  0^^\x)  + 
"  eCnpq)"* 


7  (l-6pq)  0   (x)  +  -^     -T-^—   0   Cx)^ 


24npq 


24(npq)|  I     ^ 


•^a-^^0^^\x)\]  +  o(n-2) 

The  approximation  for  the  cumulative  binomial  distribution  function  can  be 
found  by  summing  the  terms  of  f  (s). 

Because  the  Bernoulli  random  variables  have  cumulants  existing  for  all 
orders,  the  error  term  can  be  made  as  small  as  one  wishes  by  continuing  the 
expansion  to  as  many  terms  as  needed. 
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CONCLUSION 

The  result  of  Govindarajulu  approximation  is  that  the  error  term  can 
be  made  as  small  as  one  wishes  by  continuing  the  expansion  to  as  many  terms 
as  needed.   However  the  mathematics  involved  in  doing  this  becomes  more 
complicated  as  more  terras  are  calculated  in  order  to  reduce  the  error. 

The  Poisson  Gram-Charlier  approximation  performs  best  if  it  is  used 
only  if  n  ^  5  and  np  s  0.8.   In  this  range  the  maximum  error  can  be  kept 
below  .005,  but  with  an  error  of  .005  and  P  =  .01,   the  relative  error 
would  be  50  per  cent. 

Peizer  and  Pratt's  approximations,   z. ,  or  the  more  refined  z  , 
given  their  graph  to  calculate  g,  are  easier  to  calculate  than  Camp's  cube 
root  approximation  zI/o«  Also  it  has  been  shown  by  asymptotic  theory  that 
Peizer  and  Pratt's  approximations  have  less  error  than  Camp's.  Plnkham's 
square  root  approximation  Zi   is  easier  to  calculate  than  either  Peizer  and 
Pratt's  or  Camp's,  but  also  is  less  accurate. 

The  method  that  one  chooses  to  use  should  be  determined  by  the  amount 
of  accuracy  needed  and  the  time  that  can  be  allotted. 
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The  binomial  distribution  is  hard  to  tabulate  directly  since  the  tabu- 
lation involves  three  parameters.   There  are  many  different  approximations 
for  the  binomial,  but  the  problem  of  finding  a  satisfactory  approximation 
appears  to  be  that  of  determining  the  error  involved  when  each  is  used. 
Five  different  approximations  are  presented  in  this  paper. 

Let  P  be  a  binomial  tail  probability.   Then  define  0   to  be  the 
standard  normal  left  or  right  tail  function  according  as  P  is  a  left  or 
right  tail  probability.  A  value  z  is  sought  such  that 
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P  is  a  left  tail  probability 
or 
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{24x)       e    dx  is  approximately  equal  to  P  if 


P  is  a  right  tail  probability. 

Peizer  and  Pratt  have  developed  a  new  approximation  for  estimating  z, 
z.   or  the  more  refined  z„,   which  appears  to  give  a  smaller  error  than 
Camp's  cube  root  approximation  for  z,   Pinkhara's  square  root  approximation 
for  z,  or  the  usual  normal  approximation. 

The  Poisson  Gram-Charlier  approximation  is  based  on  the  fact  that  the 
binomial  distribution  can  be  estimated  by  the  Poisson  distribution  and 
performs  best  if  it  is  used  only  if   n  *  5  and   npsO.S. 

Finally  Govindarajulu' s  approximation  is  derived  by  using  the  fact  that 
the  binomial  random  variable  S   can  be  expressed  as  a  sum  of  n  independ- 
ently identically  distributed  Bernoulli  random  variables  and  that  these 
random  variables  are  also  lattice  random  variables.   His  approximation  can 


be  made  as  accurate  as  one  desires  since  the  error  term  can  be  made  as 
small  as  desired. 


